Picture for Dong Yu

Dong Yu

THRD: A Training-Free Multi-Turn Defense Framework for Jailbreak Attacks on Large Language Models

Add code
Jun 01, 2026
Viaarxiv icon

Do Gender Cues Affect LLM Value Trade-offs? Evidence from a Controlled Decision Benchmark

Add code
Jun 01, 2026
Viaarxiv icon

Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding

Add code
Apr 23, 2026
Viaarxiv icon

Audio-DeepThinker: Progressive Reasoning-Aware Reinforcement Learning for High-Quality Chain-of-Thought Emergence in Audio Language Models

Add code
Apr 20, 2026
Viaarxiv icon

Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data

Add code
Apr 20, 2026
Viaarxiv icon

SPAGBias: Uncovering and Tracing Structured Spatial Gender Bias in Large Language Models

Add code
Apr 16, 2026
Viaarxiv icon

Unlocking Strong Supervision: A Data-Centric Study of General-Purpose Audio Pre-Training Methods

Add code
Mar 26, 2026
Viaarxiv icon

Covo-Audio Technical Report

Add code
Feb 10, 2026
Viaarxiv icon

Locas: Your Models are Principled Initializers of Locally-Supported Parametric Memories

Add code
Feb 04, 2026
Viaarxiv icon

Verified Critical Step Optimization for LLM Agents

Add code
Feb 03, 2026
Viaarxiv icon